Robust word recognition using articulatory trajectories and gestures

نویسندگان

Vikramjit Mitra

Hosung Nam

Carol Y. Espy-Wilson

Elliot Saltzman

Louis Goldstein

چکیده

Articulatory Phonology views speech as an ensemble of constricting events (e.g. narrowing lips, raising tongue tip), gestures, at distinct organs (lips, tongue tip, tongue body, velum, and glottis) along the vocal tract. This study shows that articulatory information in the form of gestures and their output trajectories (tract variable time functions or TVs) can help to improve the performance of automatic speech recognition systems. The lack of any natural speech database containing such articulatory information prompted us to use a synthetic speech dataset (obtained from Haskins Laboratories TAsk Dynamic model of speech production) that contains acoustic waveform for a given utterance and its corresponding gestures and TVs. First, we propose neural network based models to recognize the gestures and estimate the TVs from acoustic information. Second, the “synthetic-data trained” articulatory models were applied to the natural speech utterances in Aurora-2 corpus to estimate their gestures and TVs. Finally, we show that the estimated articulatory information helps to improve the noise robustness of a word recognition system when used along with the cepstral features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing articulatory gestures from speech for robust speech recognition.

Studies have shown that supplementary articulatory information can help to improve the recognition rate of automatic speech recognition systems. Unfortunately, articulatory information is not directly observable, necessitating its estimation from the speech signal. This study describes a system that recognizes articulatory gestures from speech, and uses the recognized gestures in a speech recog...

متن کامل

Mapping between acoustic and articulatory gestures

We propose a method for Acoustic-to-Articulatory Inversion based on acoustic and articulatory ‘gestures’. A definition for these gestures along with a method to segment the measured articulatory trajectories and the acoustic waveform into gestures is suggested. The gestures are parameterized by 2D DCT and 2D-cepstral coefficients respectively. The Acoustic-to-Articulatory Inversion is performed...

متن کامل

Analysis of coarticulated speech using estimated articulatory trajectories

Speech acoustic patterns vary significantly as a result of coarticulation and lenition processes that are shaped by segmental context or by performance factors such as production rate and degree of casualness. The resultant acoustic variability continues to offer serious challenges for the development of automatic speech recognition (ASR) systems. Articulatory phonology provides a formalism to ...

متن کامل

Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data

We describe a self-organising pseudo-articulatory speech production model (SPM) trained on an X-ray microbeam database, and present results when using the SPM within a speech recognition framework. Given a time-aligned phonemic string, the system uses an explicit statistical model of co-articulation to generate pseudoarticulator trajectories. From these, parametrised speech vectors are synthesi...

متن کامل

Inductive Bias against Stem Changes as Perseveration: Experimental Evidence for an Articulatory Approach to Output-Output Faithfulness

Speakers of morphologically-rich languages commonly face what has been called the Paradigm Cell Filling Problem: they know some form of a word but it is inappropriate to the current context, leading them to derive a form of that word they have never encountered (e.g., they know the singular form of a noun, and now need to produce the plural). We suggest that in performing this task speakers per...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Robust word recognition using articulatory trajectories and gestures

نویسندگان

چکیده

منابع مشابه

Recognizing articulatory gestures from speech for robust speech recognition.

Mapping between acoustic and articulatory gestures

Analysis of coarticulated speech using estimated articulatory trajectories

Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray data

Inductive Bias against Stem Changes as Perseveration: Experimental Evidence for an Articulatory Approach to Output-Output Faithfulness

عنوان ژورنال:

اشتراک گذاری